Deriving Information Structure from Prosodically Marked Text with Lexicalized Tree Adjoining Grammars

نویسنده

  • Aravind Joshi
چکیده

This paper proposes a method for integrating intonation and information structure into the Lexicalized Tree Adjoining Grammar (LTAG) formalism. The method works fully within LTAG and requires no changes or additions to the basic formalism. From the existing CCG analysis, we denote boundary tones as lexical items and pitch accents as features of lexical items. We then show how prosodically marked text can be parsed to produce a derivation with the correct semantics and the appropriate information structure for the sentence. Although this paper is concerned with the recognition of prosodically marked text, the method described is also applicable to generation. This system has been implemented and tested using a wide-coverage LTAG grammar. The results in this paper also show how an account of intonational structure can be given in a lexicalized grammar with built-in constituencies in LTAG in contrast to lexical systems with exible constituencies as in Combinatory Categorial Grammar (CCG). Submission Type: Regular Paper Topic Areas: L2. Syntax & parsing L3. Semantics, pragmatics, cognition Author of Record: Gann Bierner Under consideration for other conferences (specify)? none Deriving Information Structure from Prosodically Marked Text with Lexicalized Tree Adjoining Grammars Abstract This paper proposes a method for integrating intonation and information structure into the Lexicalized Tree Adjoining Grammar (LTAG) formalism. The method works fully within LTAG and requires no changes or additions to the basic formalism. From the existing CCG analysis, we denote boundary tones as lexical items and pitch accents as features of lexical items. We then show how prosodically marked text can be parsed to produce a derivation with the correct semantics and the appropriate information structure for the sentence. Although this paper is concerned with the recognition of prosodically marked text, the method described is also applicable to generation. This system has been implemented and tested using a wide-coverage LTAG grammar. The results in this paper also show how an account of intonational structure can be given in a lexicalized grammar with built-in constituencies in LTAG in contrast to lexical systems with exible constituencies as in Combinatory Categorial Grammar (CCG).This paper proposes a method for integrating intonation and information structure into the Lexicalized Tree Adjoining Grammar (LTAG) formalism. The method works fully within LTAG and requires no changes or additions to the basic formalism. From the existing CCG analysis, we denote boundary tones as lexical items and pitch accents as features of lexical items. We then show how prosodically marked text can be parsed to produce a derivation with the correct semantics and the appropriate information structure for the sentence. Although this paper is concerned with the recognition of prosodically marked text, the method described is also applicable to generation. This system has been implemented and tested using a wide-coverage LTAG grammar. The results in this paper also show how an account of intonational structure can be given in a lexicalized grammar with built-in constituencies in LTAG in contrast to lexical systems with exible constituencies as in Combinatory Categorial Grammar (CCG).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

PreRkTAG: Prediction of RNA Knotted Structures Using Tree Adjoining Grammars

Background: RNA molecules play many important regulatory, catalytic and structural <span style="font-variant: normal; font-style: norma...

متن کامل

Extraction of Tree Adjoining Grammars from a Treebank for Korean

We present the implementation of a system which extracts not only lexicalized grammars but also feature-based lexicalized grammars from Korean Sejong Treebank. We report on some practical experiments where we extract TAG grammars and tree schemata. Above all, full-scale syntactic tags and well-formed morphological analysis in Sejong Treebank allow us to extract syntactic features. In addition, ...

متن کامل

Automatically Extracting and Comparing Lexicalized Grammars for Different Languages

In this paper, we present a quantitative comparison between the syntactic structures of three languages: English, Chinese and Korean. This is made possible by first extracting Lexicalized Tree Adjoining Grammars from annotated corpora for each language and then performing the comparison on the extracted grammars. We found that the majority of the core grammar structures for these three language...

متن کامل

Things between Lexicon and Grammar

A number of grammar formalisms were proposed in 80’s, such as Lexical Functional Grammars, Generalized Phrase Structure Grammars, and Tree Adjoining Grammars. Those formalisms then started to put a stress on lexicon, and were called as lexicalist (or lexicalized) grammars. Representative examples of lexicalist grammars were Head-driven Phrase Structure Grammars (HPSG) and Lexicalized Tree Adjoi...

متن کامل

Extracting Tree Adjoining Grammars from Bracketed Corpora

Fei Xia Department of Computer and Information Science University of Pennsylvania 3401 Walnut Street, Suite 400A Philadelphia PA 19104, USA [email protected] Abstract In this paper, we report our work on extracting lexicalized tree adjoining grammars (LTAGs) from partially bracketed corpora. The algorithm rst fully brackets the corpora, then extracts elementary trees (etrees), and nally l...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998